skip to main content


Search for: All records

Creators/Authors contains: "Li, Sha"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. The application of high-power, few-cycle, long-wave infrared (LWIR, 8–20 µm) pulses in strong-field physics is largely unexplored due to the lack of suitable sources. However, the generation of intense pulses with >6 µm wavelength range is becoming increasingly feasible with the recent advances in high-power ultrashort lasers in the middle-infrared range that can serve as a pump for optical parametric amplifiers (OPA). Here we experimentally demonstrate the feasibility of this approach by building an OPA pumped at 2.4 µm that generates 93 µJ pulses at 9.5 µm, 1 kHz repetition rate with sub-two-cycle pulse duration, 1.6 GW peak power, and excellent beam quality. The results open a wide range of applications in attosecond physics (especially for studies of condensed phase samples), remote sensing, and biophotonics.

     
    more » « less
  2. Event schemas are a form of world knowledge about the typical progression of events. Recent methods for event schema induction use information extraction systems to construct a large number of event graph instances from documents, and then learn to generalize the schema from such instances. In contrast, we propose to treat event schemas as a form of commonsense knowledge that can be derived from large language models (LLMs). This new paradigm greatly simplifies the schema induction process and allows us to handle both hierarchical relations and temporal relations between events in a straightforward way. Since event schemas have complex graph structures, we design an incremental prompting and verification method INCPROMPT to break down the construction of a complex event graph into three stages: event skeleton construction, event expansion, and event-event relation verification. Compared to directly using LLMs to generate a linearized graph, INCPROMPT can generate large and complex schemas with 7.2% F1 improvement in temporal relations and 31.0% F1 improvement in hierarchical relations. In addition, compared to the previous state-of-the-art closed-domain schema induction model, human assessors were able to cover ∼10% more events when translating the schemas into coherent stories and rated our schemas 1.3 points higher (on a 5-point scale) in terms of readability. 
    more » « less
  3. Abstract

    Studies of laser-driven strong field processes subjected to a (quasi-)static field have been mainly confined to theory. Here we provide an experimental realization by introducing a bichromatic approach for high harmonic generation (HHG) in a dielectric that combines an intense 70 femtosecond duration mid-infrared driving field with a weak 2 picosecond period terahertz (THz) dressing field. We address the physics underlying the THz field induced static symmetry breaking and its consequences on the efficient production/suppression of even-/odd-order harmonics, and demonstrate the ability to probe the HHG dynamics via the modulation of the harmonic distribution. Moreover, we report a delay-dependent even-order harmonic frequency shift that is proportional to the time derivative of the THz field. This suggests a limitation of the static symmetry breaking interpretation and implies that the resultant attosecond bursts are aperiodic, thus providing a frequency domain probe of attosecond transients while opening opportunities in precise attosecond pulse shaping.

     
    more » « less
  4. Abstract

    We performed a rigorous reverberation-mapping analysis of the broad-line region (BLR) in a highly accreting (L/LEdd= 0.74–3.4) active galactic nucleus, Markarian 142 (Mrk 142), for the first time using concurrent observations of the inner accretion disk and the BLR to determine a time lag for the Hβλ4861 emission relative to the ultraviolet (UV) continuum variations. We used continuum data taken with the Niel Gehrels Swift Observatory in theUVW2 band, and the Las Cumbres Observatory, Dan Zowada Memorial Observatory, and Liverpool Telescope in thegband, as part of the broader Mrk 142 multiwavelength monitoring campaign in 2019. We obtained new spectroscopic observations covering the Hβbroad emission line in the optical from the Gemini North Telescope and the Lijiang 2.4 m Telescope for a total of 102 epochs (over a period of 8 months) contemporaneous to the continuum data. Our primary result states a UV-to-Hβtime lag of8.680.72+0.75days in Mrk 142 obtained from light-curve analysis with a Python-based running optimal average algorithm. We placed our new measurements for Mrk 142 on the optical and UV radius–luminosity relations for NGC 5548 to understand the nature of the continuum driver. The positions of Mrk 142 on the scaling relations suggest that UV is closer to the “true” driving continuum than the optical. Furthermore, we obtainlog(M/M)= 6.32 ± 0.29 assuming UV as the primary driving continuum.

     
    more » « less
  5. Schema induction builds a graph representation explaining how events unfold in a scenario. Existing approaches have been based on information retrieval (IR) and information extraction (IE), often with limited human curation. We demonstrate a human-in-the-loop schema induction system powered by GPT-3. We first describe the different modules of our system, including prompting to generate schematic elements, manual edit of those elements, and conversion of those into a schema graph. By qualitatively comparing our system to previous ones, we show that our system not only transfers to new domains more easily than previous approaches, but also reduces efforts of human curation thanks to our interactive interface. 
    more » « less
  6. null (Ed.)
    Event extraction has long been treated as a sentence-level task in the IE community. We argue that this setting does not match human information seeking behavior and leads to incomplete and uninformative extraction results. We propose a document-level neural event argument extraction model by formulating the task as conditional generation following event templates. We also compile a new document-level event extraction benchmark dataset WIKIEVENTS which includes complete event and coreference annotation. On the task of argument extraction, we achieve an absolute gain of 7.6% F1 and 5.7% F1 over the next best model on the RAMS and WIKIEVENTS datasets respectively. On the more challenging task of informative argument extraction, which requires implicit coreference reasoning, we achieve a 9.3% F1 gain over the best baseline. To demonstrate the portability of our model, we also create the first end-to-end zero-shot event extraction framework and achieve 97% of fully supervised model’s trigger extraction performance and 82% of the argument extraction performance given only access to 10 out of the 33 types on ACE. 
    more » « less
  7. null (Ed.)

    Graph Neural Networks (GNNs) have been shown to be powerful in a wide range of graph-related tasks. While there exists various GNN models, a critical common ingredient is neighborhood aggregation, where the embedding of each node is updated by referring to the embedding of its neighbors. This paper aims to provide a better understanding of this mechanisms by asking the following question: Is neighborhood aggregation always necessary and beneficial? In short, the answer is no. We carve out two conditions under which neighborhood aggregation is not helpful: (1) when a node's neighbors are highly dissimilar and (2) when a node's embedding is already similar with that of its neighbors. We propose novel metrics that quantitatively measure these two circumstances and integrate them into an Adaptive-layer module. Our experiments show that allowing for node-specific aggregation degrees have significant advantage over current GNNs.

     
    more » « less
  8. The automated construction of topic taxonomies can benefit numerous applications, including web search, recommendation, and knowledge discovery. One of the major advantages of automatic taxonomy construction is the ability to capture corpus-specific information and adapt to different scenarios. To better reflect the characteristics of a corpus, we take the meta-data of documents into consideration and view the corpus as a text-rich network. In this paper, we propose NetTaxo, a novel automatic topic taxonomy construction framework, which goes beyond the existing paradigm and allows text data to collaborate with network structure. Specifically, we learn term embeddings from both text and network as contexts. Network motifs are adopted to capture appropriate network contexts. We conduct an instance-level selection for motifs, which further refines term embedding according to the granularity and semantics of each taxonomy node. Clustering is then applied to obtain sub-topics under a taxonomy node. Extensive experiments on two real-world datasets demonstrate the superiority of our method over the state-of-the-art, and further verify the effectiveness and importance of instance-level motif selection. 
    more » « less
  9. null (Ed.)